Using the structural content of documents to automatically generate quality metadata
نویسنده
چکیده
....................................................................................................................... i Preface ........................................................................................................................ iii Acknowledgements .................................................................................................... v
منابع مشابه
An Approach To Automatically Generate Digital Library Image Metadata For Semantic And Content- Based Retrieval
Metadata represents textual information attached to an image or resource to aid identification and retrieval of that resource. In this paper it is revealed an approach to automate the creation of digital library image metadata embedding semantic and content features. These features will make more precise image content indexing and will allow fast retrieval of images in digital libraries, based ...
متن کاملAutomatic metadata mining from multilingual enterprise content
Personalization is increasingly vital especially for enterprises to be able to reach their customers. The key challenge in supporting personalization is the need for rich metadata, such as metadata about structural relationships, subject/concept relations between documents and cognitive metadata about documents (e.g. difficulty of a document). Manual annotation of large knowledge bases with suc...
متن کاملA Tool for Semi-Automatic Generation and Maintenance of Taxonomies from Semi-Structured Documents
This chapter introduces OntoExtractor, a tool for the semi-automatic generation of the taxonomy from a set of documents or data sources. The tool generates the taxonomy in a bottom-up fashion. Starting from structural analysis of the documents, it produces a set of clusters, which can be refined by a further grouping created by content analysis. Metadata describing the content of each cluster i...
متن کاملTemplate for Regular Entry
DEFINITION The widespread search engines, in the professional as well as the personal context, used to work on the basis of textual information associated or extracted from indexed documents. Nowadays, most of the exchanged or stored documents have multimedia content. To reduce the technological gap so that these engines still can work on multimedia content, it is very convenient developing met...
متن کاملKnowledge Retrieval and the World Wide Web
L ARGE-SCALE WEB SEARCH engines effectively retrieve entire documents, but they are imprecise, because they do not exploit and hence retrieve the semantic Web document content. We cannot automatically extract such content from general documents yet. Manually structuring Web documents— for example, with XML—lets us retrieve more precise information using stringand structure-matching tools, such ...
متن کامل